Technical Report on Stable Diffusion 3 Reveals Sora-like Architecture Details
The technical report on Stable Diffusion 3 reveals that SD3 adopts the Multimodal Diffusion Transformer architecture MMDiT. SD3 introduces reweighted flow technology to enhance performance. The report discusses the scaling research of SD3 and anticipates future performance improvements along with issues and suggestions regarding the text encoder.